智能论文笔记

Variance of Twitter Embeddings and Temporal Trends of COVID-19 cases

Mayank Sethi , Ambika Sadhu , Khushbu Pahwa , Sargun Nagpal , Tavpritesh Sethi

分类：自然语言处理

2021-09-30

冠状病毒大流行的严重程度需要有效的行政决定。在印度超过4万人的人屈服于Covid-19，拥有超过3亿卢比的确认案例，仍然计数。合理的第三波的威胁继续困扰数百万。在这种不断变化的病毒动态中，预测性建模方法可以用作整体工具。大流行进一步引发了一个前所未有的社交媒体使用。本文旨在提出一种利用社交媒体，特别推特的方法来预测与Covid-19案件相关的即将发生的情景。在这项研究中，我们寻求了解Covid-19相关推文的潮流如何表明案件的增加。这种前瞻性分析可用于帮助管理员及时资源分配，以减少损坏的严重程度。使用Word Embeddings来捕获推文的语义含义，我们识别大量尺寸（SDS）.Or方法，预测患情况的上升时间为15天，30天，R2分别为0.80和0.62。最后，我们解释了SDS的主题效用。

translated by 谷歌翻译

NFResNet: Multi-scale and U-shaped Networks for Deblurring

Tanish Mittal , Preyansh Agrawal , Esha Pahwa , Aarya Makwana

分类：计算机视觉

2022-12-12

Multi-Scale and U-shaped Networks are widely used in various image restoration problems, including deblurring. Keeping in mind the wide range of applications, we present a comparison of these architectures and their effects on image deblurring. We also introduce a new block called as NFResblock. It consists of a Fast Fourier Transformation layer and a series of modified Non-Linear Activation Free Blocks. Based on these architectures and additions, we introduce NFResnet and NFResnet+, which are modified multi-scale and U-Net architectures, respectively. We also use three different loss functions to train these architectures: Charbonnier Loss, Edge Loss, and Frequency Reconstruction Loss. Extensive experiments on the Deep Video Deblurring dataset, along with ablation studies for each component, have been presented in this paper. The proposed architectures achieve a considerable increase in Peak Signal to Noise (PSNR) ratio and Structural Similarity Index (SSIM) value.

translated by 谷歌翻译

DroneAttention: Sparse Weighted Temporal Attention for Drone-Camera Based Activity Recognition

Santosh Kumar Yadav , Achleshwar Luthra , Esha Pahwa , Kamlesh Tiwari , Heena Rathore , Hari Mohan Pandey , Peter Corcoran

分类：计算机视觉

2022-12-07

Human activity recognition (HAR) using drone-mounted cameras has attracted considerable interest from the computer vision research community in recent years. A robust and efficient HAR system has a pivotal role in fields like video surveillance, crowd behavior analysis, sports analysis, and human-computer interaction. What makes it challenging are the complex poses, understanding different viewpoints, and the environmental scenarios where the action is taking place. To address such complexities, in this paper, we propose a novel Sparse Weighted Temporal Attention (SWTA) module to utilize sparsely sampled video frames for obtaining global weighted temporal attention. The proposed SWTA is comprised of two parts. First, temporal segment network that sparsely samples a given set of frames. Second, weighted temporal attention, which incorporates a fusion of attention maps derived from optical flow, with raw RGB images. This is followed by a basenet network, which comprises a convolutional neural network (CNN) module along with fully connected layers that provide us with activity recognition. The SWTA network can be used as a plug-in module to the existing deep CNN architectures, for optimizing them to learn temporal information by eliminating the need for a separate temporal stream. It has been evaluated on three publicly available benchmark datasets, namely Okutama, MOD20, and Drone-Action. The proposed model has received an accuracy of 72.76%, 92.56%, and 78.86% on the respective datasets thereby surpassing the previous state-of-the-art performances by a margin of 25.26%, 18.56%, and 2.94%, respectively.

translated by 谷歌翻译

DA-CIL: Towards Domain Adaptive Class-Incremental 3D Object Detection

Ziyuan Zhao , Mingxi Xu , Peisheng Qian , Ramanpreet Singh Pahwa , Richard Chang

分类：计算机视觉 | 人工智能

2022-12-05

Deep learning has achieved notable success in 3D object detection with the advent of large-scale point cloud datasets. However, severe performance degradation in the past trained classes, i.e., catastrophic forgetting, still remains a critical issue for real-world deployment when the number of classes is unknown or may vary. Moreover, existing 3D class-incremental detection methods are developed for the single-domain scenario, which fail when encountering domain shift caused by different datasets, varying environments, etc. In this paper, we identify the unexplored yet valuable scenario, i.e., class-incremental learning under domain shift, and propose a novel 3D domain adaptive class-incremental object detection framework, DA-CIL, in which we design a novel dual-domain copy-paste augmentation method to construct multiple augmented domains for diversifying training distributions, thereby facilitating gradual domain adaptation. Then, multi-level consistency is explored to facilitate dual-teacher knowledge distillation from different domains for domain adaptive class-incremental learning. Extensive experiments on various datasets demonstrate the effectiveness of the proposed method over baselines in the domain adaptive class-incremental learning scenario.

translated by 谷歌翻译

SWTF: Sparse Weighted Temporal Fusion for Drone-Based Activity Recognition

Santosh Kumar Yadav , Esha Pahwa , Achleshwar Luthra , Kamlesh Tiwari , Hari Mohan Pandey , Peter Corcoran

分类：计算机视觉

2022-11-10

Drone-camera based human activity recognition (HAR) has received significant attention from the computer vision research community in the past few years. A robust and efficient HAR system has a pivotal role in fields like video surveillance, crowd behavior analysis, sports analysis, and human-computer interaction. What makes it challenging are the complex poses, understanding different viewpoints, and the environmental scenarios where the action is taking place. To address such complexities, in this paper, we propose a novel Sparse Weighted Temporal Fusion (SWTF) module to utilize sparsely sampled video frames for obtaining global weighted temporal fusion outcome. The proposed SWTF is divided into two components. First, a temporal segment network that sparsely samples a given set of frames. Second, weighted temporal fusion, that incorporates a fusion of feature maps derived from optical flow, with raw RGB images. This is followed by base-network, which comprises a convolutional neural network module along with fully connected layers that provide us with activity recognition. The SWTF network can be used as a plug-in module to the existing deep CNN architectures, for optimizing them to learn temporal information by eliminating the need for a separate temporal stream. It has been evaluated on three publicly available benchmark datasets, namely Okutama, MOD20, and Drone-Action. The proposed model has received an accuracy of 72.76%, 92.56%, and 78.86% on the respective datasets thereby surpassing the previous state-of-the-art performances by a significant margin.

translated by 谷歌翻译

Model Blending for Text Classification

Ramit Pahwa

分类：机器学习 | 人工智能

2022-08-05

深度神经网络（DNNS）已被证明在各种应用程序中都成功了，例如语音识别和合成，计算机视觉，机器翻译和游戏播放，仅举几例。但是，现有的深度神经网络模型在计算上是昂贵且内存密集型的，阻碍了其在存储器资源低或具有严格延迟要求的应用程序中的部署。因此，一种自然的想法是在深网中执行模型压缩和加速度，而不会显着降低模型性能，这就是我们所谓的降低复杂性。在以下工作中，我们尝试通过将其知识提炼为基于CNN的模型，从而降低自然语言任务的最新模型状态LSTM模型的复杂性，从而减少测试过程中的推理时间（或延迟）。

translated by 谷歌翻译

Pseudo-Poincaré: A Unification Framework for Euclidean and Hyperbolic Graph Neural Networks

Mehrdad Khatir , Nurendra Choudhary , Sutanay Choudhury , Khushbu Agarwal , Chandan K. Reddy

分类：机器学习 | 神经与进化计算

2022-06-09

双曲线神经网络由于对几个图形问题的有希望的结果，包括节点分类和链接预测，因此最近引起了极大的关注。取得成功的主要原因是双曲空间在捕获图数据集的固有层次结构方面的有效性。但是，在非层次数据集方面，它们在概括，可伸缩性方面受到限制。在本文中，我们对双曲线网络进行了完全正交的观点。我们使用Poincar \'e磁盘对双曲线几何形状进行建模，并将其视为磁盘本身是原始的切线空间。这使我们能够用欧几里院近似替代非尺度的M \“ Obius Gyrovector操作，因此将整个双曲线模型简化为具有双曲线归一化功能的欧几里得模型。它仍然在Riemannian歧管中起作用，因此我们称其为伪poincar \'e框架。我们将非线性双曲线归一化应用于当前的最新均质和多关系图网络，与欧几里得和双曲线对应物相比，性能的显着改善。这项工作的主要影响在于其在欧几里得空间中捕获层次特征的能力，因此可以替代双曲线网络而不会损失性能指标，同时利用欧几里得网络的功能，例如可解释性和有效执行各种模型组件。

translated by 谷歌翻译

An Opinion Mining of Text in COVID-19 Issues along with Comparative Study in ML, BERT & RNN

Md. Mahadi Hasan Sany , Mumenunnesa Keya , Sharun Akter Khushbu , Akm Shahariar Azad Rabby , Abu Kaisar Mohammad Masum

分类：神经与进化计算 | 自然语言处理

2022-01-06

全球世界正在穿越大流行形势，这是一个灾难性的呼吸综合征爆发被认为是Covid-19。这是212个国家的全球威胁，即人们每天都会遇到强大的情况。相反，成千上万的受感染的人居住丰富的山脉。心理健康也受到全球冠状病毒情况的影响。由于这种情况，在线消息来源使普通人在任何议程中分享他们的意见。如受影响的新闻相关的积极和消极，财务问题，国家和家庭危机，缺乏进出口盈利系统等。不同的情况是最近在任何地方的时尚新闻。因此，在瞬间内产生了大量的文本，在次大陆领域，与其他国家的情况相同，以及文本的人民意见和情况也是相同的，但语言是不同的。本文提出了一些具体的投入以及来自个别来源的孟加拉文本评论，可以确保插图的目标，即机器学习结果能够建立辅助系统。意见挖掘辅助系统可能以可能的所有语言偏好有影响。据我们所知，文章预测了Covid-19问题上的Bangla输入文本，提出了ML算法和深度学习模型分析还通过比较分析检查未来可达性。比较分析规定了关于文本预测精度的报告与ML算法和79％以及深度学习模型以及79％的报告。

translated by 谷歌翻译

Attention-based Aspect Reasoning for Knowledge Base Question Answering on Clinical Notes

Ping Wang , Tian Shi , Khushbu Agarwal , Sutanay Choudhury , Chandan K. Reddy

分类：自然语言处理

2021-08-01

在过去的几年中，临床笔记中的问题回答（QA）引起了很多关注。临床领域中现有的机器阅读理解方法只能处理有关单个临床文本的问题，并且无法检索有关多个患者及其临床笔记的信息。为了处理更复杂的问题，我们旨在从临床注释中创建知识库，以将不同的患者和临床笔记联系起来，并进行知识基础问题答案（KBQA）。根据N2C2数据集中可用的专家注释，我们首先创建了ClinicalKBQA数据集，其中包括大约9K QA对，并使用300多个问题模板涵盖了有关七个医学主题的问题。然后，我们研究了KBQA的一种基于注意力的方面推理（AAR）方法，并分析了答案的不同方面（例如，实体，类型，路径和上下文）对预测的影响。由于设计精良的编码器和注意力机制，AAR方法可实现更好的性能。从我们的实验中，我们发现这两个方面，类型和路径都使模型能够识别满足一般条件的答案，并产生较低的精度和更高的回忆。另一方面，各个方面，实体和上下文通过特定于节点的信息限制答案，并导致更高的精度和较低的回忆。

translated by 谷歌翻译